Template fitting and the large-angle CMB anomalies 
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We investigate two possible explanations for the large-angle anomalies in the Cosmic Microwave 
Background (CMB): an intrinsically anisotropic model and an inhomogeneous model. We take as 
an example of the former a Bianchi model (which leaves a spiral pattern in the sky) and of the 
latter a background model that already contains a non-linear long-wavelength plane wave (leaving 
a stripy pattern in the sky) . We make use of an adaptation of the "template" formalism, previously 
designed to detect galactic foregrounds, to recognize these patterns and produce confidence levels 
for their detection. The "corrected" maps, from which these patterns have been removed, are free 
of anomalies, in particular their quadrupole and octupole are not planar and their intensities not 
low. We stress that although the "template" detections are not found to be statistically significant 
they do correct statistically significant anomalies. 



I. INTRODUCTION 

Since the release of the first year data from NASA's 
Wilkinson Microwave Anisotropy Probe there have 
been extensive studies of the Cosmic Microwave Back- 
ground (CMB) maps. Generally the results impressively 
support the favoured ACDM cosmological model, though 
a lack of power on the larg est scales has produced much 
debate (see e.g., d M, Q E H Of further 

interest are studies that have detected evidence for sig- 
nificant departures from the fundamental cosmological 
assum ptio n that the universe is isotropic and homoge- 
neous pi e h h m m m m m, m m . Th US 

it appears the large angle CMB multipoles are anoma- 
lous in two seemingly distinct ways. Firstly their power 
Ci is abnormally low. Secondly they have an improbable 
directionality revealed by the fact that for a certain z- 
axis' orientation one m-mode absorbs most of the power, 
for I = 2, 5 [35| . Naively one may expect the two fea- 
tures to be related: the absence of power in all possible 
m modes may be why the observed average Ci is so low. 
But this has to be made rigorous. Such is the purpose of 
this letter: could it be that accounting for the large-angle 
directional multipoles accounts for the observed low C{1 
ACDM theory matches the power spectrum very well 
on most scales |3(, but the strong non-Gaussianity and/or 
anisotropy of the low multipoles suggests a competition 
between two processes: Gaussian isotropic random fluc- 
tuations and a deterministic process. One possibility for 
the latter is intrinsic cosmic anisotropy (as encoded in 
the Bianchi models Vwhich imparts a spiral pattern in the 
CMB as studied in 0,11, HI Hi Another is a background 
model that already contains a non-linear long-wavelength 
plane wave. This would leave a striking stripy pattern in 
the sky (a "Jupiter" , if the polar axis is defined suitably). 
The matter may then be addressed by regarding these 
patterns as a "contamination" template, and attempting 
to "correct" the data for its coupling to the template, 
thereby purifying the underlying Gaussian process. The 
formalism has been developed to deal with galactic fore- 
grounds but may be adapted to this situation. 



We therefore seek a template capable of explaining the 
preferred axis in the CMB, and ask the question: could 
the corrected map reveal higher Cgs, thus linking the two 
issues? Note that this is highly non-trivial. Usually when 
one corrects a map for a "foreground" the power in the 
corrected map is smaller. However it could also be that 
the deterministic process acted destructively. For this to 
be possible the template would have to have a pattern 
complementary to the observed data, i.e., contain power 
in (at least some of the) ms which fail to reveal it in the 
data; in addition this power would have to have acted 
destructively upon the underlying Gaussian process, so 
that only one m mode survived in the data. 

The presence of a template is more likely to add power 
than to remove it, so to explain the low-£ low power this 
way one relies in part on a chance alignment of phases be- 
tween the Gaussian fluctuations and the template. Still, 
in this paper we explore this approach with two different 
models. In both cases, once we correct the data, and are 
left with a map with higher low-multipole power Ce, and 
no preferred direction as seen in |35| . 

In Section [H] we outline our method for selecting 
favoured templates. Then iii IIIII we introduce the tem- 
plates of our two specific models, and in Section llVI we 
explain how we selected the data and record our results. 
We discuss the implications of these in SectionslVland lVll 



II. THE FORMALISM 

Our work is modelled on that of |29| but with an impor- 
tant extension. We start by reviewing |2^. Suppose the 
sky d may be modelled as the sum of a Gaussian process 
g and a non-Gaussian "template" , t. Then the Maximum 
Likelihood Estimation (MLE) problem may be set up by 
introducing a coupling parameter a so that d = g + at. 
The usual MLE problem now concerns d — at, that is: 
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(we have ignored an additive constant in the first expres- 
sion). Here M is the covariance matrix of g (which may 
include noise) and the data vectors may be either tem- 
peratures in pixel space, or a set of spherical harmonic 
coefficients ae m . In the latter case the data is complex, 
satisfying "reality" constraints; the concomitant modifi- 
cations to the formalism are trivial. For full sky maps 
the aim are the preferred formulation. 

In general we have a joint estimation problem for a and 
the unknowns parameterizing M (which can be binned 
Ce or the half-dozen parameters encoding "cosmic con- 
cordance"). However it is often the case that the range of 
is which decides the M problem and that for the template 
are disjoint. For example the geometry of the Universe O, 
or the amount of baryonic matter Qi, are mainly decided 
by the Doppler peaks, whereas we are interested in tem- 
plates producin g la rge-angle non-Gaussian features (say 
I = 2 — 5; see |35|1. One may then decouple the two 
problems: find M ignoring the template and then solve 
for a using this solution for M. 



Parameter Estimation 



The MLE problem for a then becomes the problem of 
2 The solution is 
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By construction a is a Gaussian variable. Its variance is 
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Two comments are in order. Firstly notice that this for- 
malism wants the template to be like the data. If a mys- 
terious theory was found predicting t = d, then a = 1 
and the MLE problem tells us that the Gaussian process 
is in fact absent |53| . This is not in line with common 
sense: we do know that there is a Gaussian process. Sec- 
ondly this procedure always produces a corrected map 
d — at with lower chi-squared than the original map d. 
This is fine if one suspects contamination because of an 
abnormally high chi-squared, but what if the observed 
chi-squared is too low? In that case the corrected map is 
even more abnormal. 

As is usually the case, these quibbles boil down to the 
choice of parameters and priors used. We have failed 
to impose a suitable condition enforcing the presence of 
a Gaussian process, so that the corrected maps have a 
reduced chi-squared close to 1. If we are in ai m space, 
for example, we are assuming a uniform prior in ag m - The 
peak of the distribution is always at ag m = 0, so if t is 
such that this condition can be satisfied that is precisely 
what the formalism does (a zero chi-squared is the highest 
probability configuration if one assumes uniform priors 
in aim). Seen in another way, if we have a deterministic 
template competing with a Gaussian process on equal 



footing, the formalism will always try to suppress the 
random Gaussian process in favour of the certainties of 
the template. 

To address this issue we use an alternative function 
of the data in calculating the likelihood. We base the 
likelihood on the probability distribution of the ln(x 2 ) 
(this rather than \ 2 produces the correct power spectrum 
estimator) . By changing variables the likelihood is given 
by 



-2 In C = x 2 + In det M + D ln(x 2 



(5) 



where D is the number of degrees of freedom (i.e. the 
number of pixels, or the number of ai m being considered). 
Its maximization with respect to a now leads to: 
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i.e. either the first factor is zero (equivalent to the prob- 
lem of minimizing the x 2 solved above), or % 2 = D, 
which means the reduced chi-squared is 1. This is simple 
to solve, and involves finding the roots to the quadratic 
equation |J2J. The solution is 



X 2 dt ±Vx 4 dt ~X 2 t(xi-D) 
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This formalism for correcting the map generalizes that 
of |29| . while reducing to it in the appropriate circum- 
stances. If the quadratic does not have real solutions 
for a (because the minimum of x 2 ( a ) is above D) then 
the problem reduces to |29j|. This is the case where the 
chi-squared of the data is too high, so that correcting it 
entails reducing it. In this case the two formalisms agree. 
However if the data's chi-squared is too low then our for- 
malism takes over. The quadratic then has real solutions, 
and the MLE problem is solved by solutions a± given by 
0. 

We note that this approach could be conceptually im- 
proved by factorizing the likelihood £ by £ and imposing 
uniform priors on an appropriate function of each x\- 
Minimising the likelihood with respect to a would now 
lead to the problem of solving 



(11) 



However, this is in practice impossible to solve, especially 
for large ^ max . Therefore, we are solving for the simpler 
case of x 2 = J2eX 2 e- 
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B. Model Comparison 

We have found the necessary correction to make for 
a given template. Perhaps more important is to find 
the preferred template - i.e., the preferred model. The 
Bayesian approach to model comparison is to marginalise 
over the parameters of a model, to find the probability of 
data given the model (whatever the parameters), as this 
relates to the probability of the model given the data. We 
have been working with P(a, t), and above we discuss the 
specific a solutions that maximise this for a given tem- 
plate. We now wish to compare the likelihood of different 
templates, that is we require just P(t) - a marginaliza- 
tion over a. The standard approach to this (see for e.g 
|47|) is to absorb the uncertainties of a into the co- 
variance matrix. That is 

X 2 =d T M' *d (12) 

for 

M' _1 = lim (M + cr 2 t T t)- 1 (13) 

(J — >oo 

where M is the usual covariance matrix associated with 
the ai m s as used above. Using the Sherman-Morrison- 
Woodbury [23] formula we find 

M' _1 = M _1 — M~ 1 t(t T M~ 1 t)t T M~ 1 (14) 
and therefore 

X 2 =X 2 d - {2 ^^X 2 d -T 2 (15) 

Assuming uniform priors on our templates, our model 
comparison now involves maximising the likelihood (jSJ) 
using the new expression for x 2 an( i the matrix M'. 
In [43 there is a discussion of how this method of 
marginalising over a inside the covariance matrix is 
equivalent to ignoring the data that correlates with the 
template when working out the x ''■ 

The likelihood function (J3J) has its maximum at x 2 = 
D. The effective chi-squared from a template is given by 
Eq. (|15fl and we see it is always less than xi- Therefore, if 
the initial chi-squared of the data is low: X 2 < X 2 < D, 
the preferred template is that which maximises the x 2 
(minimises T) so as to bring it as close to x\ as possible, 
the optimal solution being r = 0. Conversely, if the data 
has a high chi-squared our formalism reduces to that of 
p9|. where we maximise T so as to reduce the x 2 to as 
close to D as possible. 



III. THE TEMPLATES 

We consider two prototypes of violation of isotropy and 
homogeneity: Bianchi models and plane- wave inhomoge- 
neous models. 



A. Bianchi models 

Bianchi models are well-known homogeneous, 
anisotropic generalizations of Friedmann-Robertson- 
Walker models. They may be used as a competitor to 
standard cosmology, constraining violations of isotropy 
while still assuming homogeneity. In these models the 
CMB is never isotropic, even before fluctuations are 
added on. Of particular interest are Bianchi Vllh 
models, for which an asymmetric spiral pattern is 
imprinted upon the sky. These models were extensively 
examined in [2^| with reference to asymmetries in the 
power spectrum and anomalous hot-spots. We have 
reproduced these results with our programs. One of 
the deficiencies of the fits found in [23 (recognized by 
the authors) is that they require Q ^ 1. This would 
almost certainly lead to a bad fit at high £ even if the 
large-angle corrected map is improved. For this reason 
here we shall restrict ourselves to fi = 1 models. 

We follow the parametrisation of the Bianchi 
Vllh model from a 2j. These parameters are Q, 
x = y/h/(l — Ci), handedness, and the Euler angles 
(<f>, 8, ifi) |5_4j. We refer the reader to [2| for a fuller ex- 
planation. We fit templates with f2 = 1, and explore the 
range x S [0.1, 10], with both left and right handedness, 
over the total range of Euler angles. 

B. Plane- wave cosmology 

Inhomogeneous cosmologies are either very simple or 
very complicated [12 ■ We consider a model in which a 
plane wave in the gravitational potential $ is part and 
parcel of the background model (i.e. the unperturbed, ze- 
roth order cosmological model, before the standard Gaus- 
sian isotropic scale-invariant adiabatic fluctuations are 
added to it). 

An outstanding wave might be the result of several 
processes. It could be for example the hallmark of non- 
trivial topology [27l I45L l52| . Non- trivial topologies may 
be ruled out on the grounds that they would imprint re- 
peat patterns in the CMB sky, e.g. matching circles in 
antipodal locations for the most basic "slab topology". 
This constraint is only applicable if the fundamental do- 
main is smaller than the last scattering surface. 

Larger-than-the-horizon domains have two effects 
upon the density plane waves riding the homogeneous 
model. Firstly modes must fit into the domain, leading 
to a discretization. But they also introduce a scale in 
the problem, the size of the domain, L. Hence the usual 
naive arguments for scale-invariance no longer apply and 
fc 3 <j> 2 m ay actually be a function of kL without intro- 
ducing an arbitrary scale in the problem. We speculate 
that even if L is sufficiently large that one may neglect 
mode discretization inside the horizon, the fundamental 
mode k = 2n/L will be very intense, and dominate over 
whatever underlying (scale-invariant) Gaussian process. 
Interestingly in |27| it was shown that multipole align- 
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22.4 


58.6 


59.4 



TABLE I: The Axis of Evil behaviour before and after tem- 
plate corrections. The n (in galactic coordinates (l,b)) and 
m are listed for the low multipoles I = 2 — 5. The average 
inter-angle of the 4 axis is shown. The uncorrected map finds 
a strong alignment of axis, shown by the low average inter- 
angle, (see Section Hvl and Equation 1171 ) 

merits, like those observed 0, 0], are produced in 
certain non-trivial topologies. 

Another context in which strong long - wavelength 
modes have been studied is the work of [3l|. Here (con- 
troversial) claims were made that super horizon scale 
fluctuations may be causing the acceleration of the uni- 
verse. It would be interesting to investigate further the 
exact length/intensity ratio required for such a model to 
work (should it work at all). The impact upon the CMB 
would then fall under the present study. 

Regardless of these two possible motivations it is in- 
teresting to examine the evidence for one such wave 
in the CMB, and it is this more phenomenological ap- 
proach that will be followed in our paper. We will search 
for evidence of a dominating k-mode with wavelength 
A £ [0.5, 10] in units of the diameter of the last scatter- 
ing surface. Our parameters for this model are the wave- 
length A, the phase p, and the Euler angles {(f), 8, tp) (we 
actually do not need to concern ourselves with the first 
rotation <fi about the z-axis as the wave is cylindrically 
symmetric) . 



IV. RESULTS 

We use full sky maps, specifically we look at the 
cleaned map of [4£| as this map has been shown to have 
interesting features |35l ] . In our covariance matrix we 
include only the theoretical power spectrum terms from 
WMAP |3(, and ignore noise (we are considering only 
the low--fe). We include a 24° beam, and consider the 
£-range: 2-5 (32 degrees of freedom). For this I range 
this map finds a x\ — 27.06 < 32, and so the formalism 
outlined in Section^rcquires that we select the template 
that minimizes T. 

We use this ^-range for 2 reasons. Firstly, two sig- 
nificant and different anomalies have been seen on these 
scales; a preferred direction and low power. Secondly, 
this is the relevant £-range for the templates we are ex- 
amining. Considering an I range not covered by a tem- 
plate is pointless when the purpose is to assess the effect 
a template will have on the map. 
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FIG. 1: The Axis of Evil behaviour, before (solid line) and 
after template corrections (Bianchi corrected (dotted), Wave 
corrected (dashed)). See Equation 1171 The top panel shows 
the ratio of power in the preferred m, the second panel shows 
the m, and the bottom 2 panels display the axis n in galactic 
coordinates (l,b). 

We then minimize T over the range of parameters for 
both template types. It turns out that in neither case 
do we find one clear optimal solution: there are numer- 
ous templates that find r w within the limitations of 
computational accuracy and the rotation grid. This is 
due to the small £-range but also due to the fact that the 
observed data is such a bad fit, with a very low \ 2 ■ Note 
that we are limited to this narrow ^-range by our choice 
of models as this is where their power lies. 

Given the degeneracy of our solution we have to impose 
an additional condition to select the preferred templates. 
Considering that the full (unsolvable) problem requires 
solving i|ll[) we may break the degeneracy by evaluating 
the xj an d defining 

A=E(sfr-0 J (16) 

By minimising A we find the solution of the abridged 
problem that's closest to the solution to the full prob- 
lem. Then the corrected map will have the power spec- 
trum closest to the expected power spectrum line. This 
also breaks the degeneracy between the two a solutions 
associated with each template (see equation 0). 

We find that our preferred Bianchi template has 
x = 1.4 with left handed vorticity, and Euler angles 
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FIG. 2: The map (mK) before (top) and after (bottom) it 
has been corrected for the preferred Bianchi template (mid- 
dle). The template's power has been doubled for illustration 
purposes. 



(cf>,8,tfj) = (—82,72,-62), and therefore an axis in the 
direction (I, b) — (—62, 18) in galactic coordinates. We 
find the preferred large Wave template has A = 0.96 and 
p = 44°, in the direction (I, b) = (—38, 30) (equivalently 
a wave in the opposite direction with (I, b) = (142, —30) 
and phase ir — p). That is, the preferred wave is just 
slightly smaller than the size of the CMB. 

We examined the "Axis of Evil" behavior of the cor- 
rected maps |35j , and results are in Table |I|and Figure □ 
We define a direction for each multipole by finding the 
m and the direction n that maximises 

{21 + l)C e 

where Ciq = |a£o| 2 , C^ m = 2\a( m \ 2 for m > (notice 
that 2 modes contribute for m ^ 0) and (21 + l)Ci = 
J2 m \ a £m\ 2 - This finds the axis for which a multipole is 
most dominated by just one m-mode. As can be seen 
in Table [I] the uncorrected map finds similar directions 
for the I = 2,3,4,5 multipoles (as reported in j3f|) - 
a significant departure from isotropy. We see that the 
corrected maps find no such anisotropic behaviour. The 



FIG. 3: The map (mK) before (top) and after (bottom) it has 
been corrected for the preferred Wave template (middle) . The 
template's power has been doubled for illustration purposes. 

average inter-€ angles found are perfectly consistent with 
isotropy as compared to simulations. 

In Figures [21 and |3we plot the map before and after 
the correction, only showing the multipoles I — 2 — 5. 
In Figure 0] we plot the much examined Quadrupole and 
Octopole before and after the corrections. The alignment 
0, Q, ^|| can clearly be seen in the before maps, and 
not in the after maps, also the anomalously low power is 
increased, see Figure [5] 

We have therefore proved that all known large-angle 
anomalies may be coupled. There are models providing 
templates which can correct, in one go, all known anoma- 
lies - the low power and the "axis of evil" effect. Whether 
or not the evidence for these models is high is another 
matter, which we now proceed to examine. 



V. SIGNIFICANCE 

There are many ways to quantify the significance of 
the final correction. As already noted in Section the 
expected power spectrum of a map will be higher than 
that for a purely Gaussian map if we assume the pres- 
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FIG. 4: The Quadrupole, £ = 2 (above), and the Octopole , I = 3 (below), for the uncorrected (left) and Bianchi corrected 
(middle) and the Wave corrected (right) maps. The apparent alignment is not present in the corrected maps (mK units). 





Bianchi 



Wave 



FIG. 5: The power spectrum (mK 2 ) of the map before (solid line) and after (long-dashes) being corrected for the preferred 
template (dotted). The ACDM theoretical power spectrum from WMAP is also plotted (short-dashes). 



ence of a template. Thus, when considering the low-£ low 
power, if we take into account the presence of a template, 
then we actually make the situation worse! We consider 
simulations of Gaussian maps with the added template, 
for both the Bianchi and the Wave models. We calcu- 
late power spectrum estimator for 5000 simulations and 
indeed we find that the without adding a template, only 
2.6% of simulations find a C2 lower than that observed 
by WMAP. And with the Bianchi template added this 
reduces to 1.2%, thus making the fit worse. 

However in apparent contradiction to this we have a 
map that, when corrected for the preferred template, 
finds excellent consistency with the theoretical power 
spectrum. So, we ask a different question here. We al- 
low 1000 simulations to find their best template, for both 



model types. We record the x\ before the correction, and 
the a, , A of the correction, and we only look at those 
simulations that find a low initial Xd (525 of our 1000 
simulations). We can now assess the significance of our 
results, given the prior assumption of a destructive align- 
ment between the Gaussian process and the template. 

We find the simulations return similar results to the 
WMAP map. That is we looked at the range of a val- 
ues returned, and various functions of (a, xf , A, \ 2 d ) and 
found our WMAP result to not be particularly anoma- 
lous compared to the simulation results. This was the 
case for both the Bianchi and the Wave templates. 

We therefore must conclude that the detections are not 
significant. However, we find the method still of interest, 
and note it can be used to investigate new models of 
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contamination in the future. We give one example of a 
possible improvement. 

Qualitatively the crux of the issue lies in that we must 
rely on a chance ant i- alignment of phases between tem- 
plate and Gaussian process. We don't see power in some 
m modes because it was there both in the template 
and in the Gaussian process and the two added destruc- 
tively. This will always sound suspicious and we suggest 
that models might be set up where the underlying non- 
Gaussian process influences the production of Gaussian 
fluctuations so that the two processes are anti-correlated. 
Specifically we could have set up a likelihood where the 
X 2 functions for template and Gaussian process added 
(rather than their aim)- However such work is beyond 
the scope of this paper. 

VI. DISCUSSION 

We found templates from two different models that can 
simultaneously explain the observed anisotropic align- 
ments of the \ow-£ multipolcs and their low power. The 
corrected maps show power spectra vastly more consis- 
tent with the standard ACDM theoretical power spec- 
trum |j| than the uncorrected map. The scope of our 
work was two-fold. 

Firstly, we proposed a formalism for dealing with tem- 
plates in the presence of data which has a low chisquared. 
In this approach we promote the x 2 to the status of rele- 
vant function of the data for comparing theory and data. 
This is in contrast with the the usual af m s, and reduces 
the number of degrees of freedom. Furthermore the like- 
lihood problem is then turned on its head in that it en- 
forces the strong prior that a Gaussian process with a 
power known a priori must be present in the corrected 
maps. Thus this approach generalizes the previous one in 
that it reduces to it for cases where the data chisquared is 
too high, but leads to significantly different results when 
the observed chisquared is too low. 

Notice that standard methods for template correction 
will lead to a corrected map with a lower \ 2 ■ This is be- 
cause it is more probable that the presence of a template 
will increase the power than remove it. We are therefore 
always on the back foot when trying to use a template 
to explain a low power spectrum (see 0, C3 f° r a simi- 
lar discussion) . We note that as the current data is such 
a bad fit, most of our templates (^90 %) improved the 
Ci fit before we even imposed the second condition of 
minimising A. 

The second purpose of our paper was to use this for- 
malism to investigate whether the low power (i.e. low 
chisquared) observed in the low multipolcs and the axis 
of evil effect might be related to each other, and whether 



both could be due to an underlying anisotropic or in- 
homogeneous model of the Universe. The axis of evil 
effect consists of the fact that for a given z-axis orienta- 
tion the power in a given multipole is not distributed "at 
random" among the various m modes but concentrates on 
planar models (m = ±£) for £ = 2,3; furthermore the axis 
for which this happens is roughly the same for £ = 2, 5. 
A priori the two effects should be coupled because the Ce 
is an average over m of the variance |ei£ m | 2 . It looks as 
if some m-modes missed their power and that if this was 
reinstated the low Ce and the "axis of evil" anomalies 
would disappear. This might happen after correcting the 
nefarious influence of a deterministic template and it sug- 
gests that the template should have a strong non-planar 
component along the preferred direction. 

We considered Bianchi models and also inhomogeneous 
models where a very strong long- wavelength wave is part 
of the background model. We find the preferred parame- 
ters for these models and correct the maps. The corrected 
maps are free of anomalies, in particular their quadrupole 
and octupole are not planar and their intensities not low. 
Therefore we stress that although the "template" detec- 
tion are not found to be statistically significant they do 
correct statistically significant anomalies. 

While this paper was being finished two preprints ap- 
pear ed prop osing explanations for the axis of evil ef- 
fect [22l |5(J. One of these is rather similar in spirit 
to ours |22fl : the idea that a long wavelength mode 
is imprinted in the sky. Even though our statistical 
treatment is rather different, both papers highlight the 
same difficulty with such explanations: that in general 
waves would add, rather than subtract power. Our sug- 
gested explanation (that an anti-correlation might ex- 
ist between the Gaussian and the deterministic process) 
finds its counterpart in the model considered in 22] in 
the concept of multiplicative non-linear response. We 
find this idea very interesting, but as with our idea of 
Gaussian/template anti-correlation we stress that it may 
not be necessary. The fit between the data and the pure 
Gaussian model is so bad that a fluke anti-alignment 
of phases is already a good enough explanation (even 
though clearly the Bayesian evidence for more complex 
models will always be superior). 

We are also examining the possible correlations be- 
tween the CMB anomalies and the large-scale structure 
and defer to a future publication |3Jj | comments on [5(1 ]. 
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